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The paper focuses on the adaptation of local polynomial filters at 
the end of the sample period. We show that for real time estimation of 
signals (i.e., exactly at the boundary of the time support) we cannot 
rely on the automatic adaptation of the local polynomial smoothers, 
since the direct real time filter turns out to be strongly localized, 
and thereby yields extremely volatile estimates. As an alternative, 
we evaluate a general family of asymmetric filters that minimizes the 
mean square revision error subject to polynomial reproduction con- 
straints; in the case of the Henderson filter it nests the well-known 
Musgrave's surrogate filters. The class of filters depends on unknown 
features of the series such as the slope and the curvature of the under- 
lying signal, which can be estimated from the data. Several empirical 
examples illustrate the effectiveness of our proposal. 

1. Introduction. One of the key issues economists have faced in charac- 
terizing the dynamic behavior of macroeconomic variables, such as output 
and inflation, is separating the longer-term component from the transitory 
one. Key measurements such as dating the business cycle turning points and 
more generally the assessment of the underlying trend call for signal extrac- 
tion methods that separate the two components. Many methodologies are 
available for the task, ranging from nonparametric methods based on the 
notion of a band-pass filter and on wavelet methods [Percival and Walden 
(2000)], kernel estimation and local polynomial modeling [see, e.g., 
Fan and Gijbels (1996)] semiparametric methods based on spline smooth- 
ing and mixed models [see Ruppert, Wand and Carroll (2003) and Proietti 
(2007)], and parametric methods based on the state space models or the 
Wiener-Kolmogorov signal extraction theory [Whittle (1983)]. An essential 
and up to date monograph on measuring trends and cycles in economics 
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is Mills (2003). A problem that is common to all the methodologies is the 
reliability of the trend estimates at the end of the sample period. 

The concern of the paper is real time estimation of the underlying trend 
in a time series by means of filters that arise from fitting a local polynomial 
of a given degree with a constant bandwidth. Real time estimation is of 
outmost importance in fields like economics and deals with the estimation 
of a signal at time t using the observations available up to and including 
time t. 

A well-known property of local polynomial estimators is the automatic 
adaptation at the boundaries. It essentially means that the bias and the 
variance near the boundary are of the same order of magnitude as in the in- 
terior. See, for instance, Hastie and Loader (1993), Fan and Gijbels (1996), 
Section 3.2.5, Simonoff (1996), Section 5.2.3 and the references therein. 

It turns out, however, that for a local cubic fit, such as that arising from 
the well-known Henderson filter [Henderson (1916)], the variance inflation 
resulting from the one-sided real time direct filter is very high, and that the 
filter is strongly localized at the current observations, with a leverage that 
is close to unity. 

The paper documents this basic feature and will be concerned, in partic- 
ular, with the evaluation of alternative strategies aiming at the adaptation 
at the boundary of a given two-sided symmetric local polynomial filter. Our 
discussion will mostly refer to the Henderson filter. The latter has a long 
tradition for trend-cycle estimation in economic time series. The relevance 
of Henderson's contribution to modern local regression is stressed in the 
first chapter of Loader (1999). Henderson filters are still employed for trend 
estimation in the X-ll cascade filter, and as such are an integral part of 
the X-12-ARIMA procedure, the official seasonal adjustment procedure in 
the US, Canada, the UK and many other countries. See Dagum (1980), 
Findley et al. (1998) and Ladiray and Quenneville (2001) for more details. 

The plan of the paper is the following. Section 2 motivates the applied 
problem of interest, presenting an example dealing with the assessment of 
recent business conditions in the US housing markets. After reviewing the 
constructive principles presiding the derivation of the two-sided symmetric 
local polynomial filters, Section 3 provides a thorough assessment of the 
properties of the asymmetric filters automatically adapted at the boundary, 
which result from fitting a local polynomial with a fixed bandwidth to the 
observations available at the current time. The direct asymmetric filters 
can be equivalently derived through the reproducing kernel Hilbert space 
method, as we prove based on the Hankel representation of a reproducing 
kernel in the context of weighted least squares estimation. The key result, 
as we stressed above, is that the real time filter behaves differently from the 
other automatically adapted asymmetric filters inside the boundary. 
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Section 4 evaluates an alternative class of fixed bandwidth asymmetric 
niters that result from minimizing the mean square revision error subject to 
polynomial reproduction constraints. This class generalizes the well-known 
Musgrave's asymmetric approximation of the Henderson filters [Musgrave 
(1964), see also Doherty (2001), Gray and Thomson (2002) and 
Quenneville, Ladiray and Lefrancois (2003)], which is implemented in the 
seasonal adjustment filter X-ll, developed by the US Census Bureau [see 
Findley et al. (1998) and Ladiray and Quenneville (2001)]. The class of fil- 
ters depends on the properties of the true underlying signal, namely, its 
level, slope, curvature and so forth, which can be estimated from the data. 

In Section 5 we provide a few illustrations dealing with economic time 
series. They address the issue of approximating the Henderson filter in real 
time and show that the slope and curvature play a relevant role for the 
derivation of the optimal real time approximation. These features can be 
estimated from the available data. The two features are, on the contrary, 
neglected by Musgrave's asymmetric filters, which postulate that the true 
underlying signal is linear but only require that the approximate filter is 
capable of reproducing a zero degree polynomial. In Section 6 we draw our 
conclusions. 

2. A motivating example: assessing recent trends in the housing market. 

Figure 1 displays the monthly time series of housing starts, for the period 
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Fig. 1. New Privately Owned Housing Units Started, US Source: Census Bureau. Orig- 
inal series and two-sided nonparametric trend estimates obtained by the Henderson filter. 
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January 1959-October 2007. The series, published by the US Census Bu- 
reau, concerns the number of privately owned new housing units on which 
construction has been started over the reference period. See the US Census 
Bureau website at www.census.gov for further documentation. 

Housing starts represent an important indicator of the state of the econ- 
omy. In a recent paper Learner (2007) argues that residential investment 
offers the best early warning sign of an oncoming recession. It is evident 
from Figure 1 that housing starts peaked at the end of 2005, and underwent 
thenceforth a very steep decline. The analyst is typically interested in the 
timely assessment of the turning point and of the most recent trends, in 
a noisy environment. The estimation of turning points typically requires a 
trend-cycle estimate, before the application of a dating algorithm, such as 
a Bry and Boschan (1971) routine, which is widely popular in economics. 
This operation is necessary in order to prevent high frequency fluctuations 
from interfering with the identification of turning points, producing many 
false candidates. 

The dashed line overlaid to the plot is the nonparametric estimate of the 
trend-cycle component produced by local cubic regression using a particular 
kernel, the two-sided Henderson kernel, which is discussed in more detail 
and contextualized in the next section. The estimates are computed on 21 
consecutive monthly observations: the observation at the time of interest 
and 10 observations on each side of it (the bandwidth has been selected by 
cross-validation). As pointed out in the introductory remarks, this is only 
one of the possible solutions to the signal extraction problem. Another pos- 
sibility would be to estimate the component of interest by postulating a 
(semi) parametric model for it, for example, an integrated random walk, an 
ARIMA model or a smoothing spline. Be that as it may, the estimation at 
the end of the sample period is a delicate issue for any signal extraction 
method. Think, for instance, to wavelet multiresolution analysis: the tradi- 
tional solution to the problem of handling boundary conditions, based on 
the circularity assumption [see Percival and Walden (2000), pages 197-199], 
is implausible here due to the nonstationary nature of the series. 

Turning back our attention to our local polynomial approach, it is evident 
from the plot that the two-sided estimates of the signal are not available for 
the last 10 months, which are the most interesting from the point of view of 
the business cycle analyst and the policy maker. Actually, a direct solution 
is readily available: it arises from the automatic adaptation of a local cubic 
polynomial to the available observations at the end of the sample, using the 
same bandwidth (or a nearest neighbor bandwidth) and the same kernel. 
However, we shall argue in the paper that the corresponding estimates are 
inherently too volatile. This feature is visible from the plot of the real time 
estimates (i.e., the one-sided estimates using the current observation and 10 
past observations) arising from the direct asymmetric adaptation, which are 
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Fig. 2. New Privately Owned Housing Units Started, US Source: Census Bureau. End 
of sample estimates of the trend-cycle component obtained by two asymmetric adaptations 
of the Henderson filter. 

displayed in the left panel of Figure 2 (solid line) along with the original time 
series observations (dots) and the final two-sided Henderson filter estimates 
(dashed line). These estimates are very rough; they are close to the original 
observations and far away from the final two-sided estimates. As such, they 
potentially give rise to a large number of false candidate turning points. 
Moreover, the revision of the estimates as new information becomes available 
is substantial, as the comparison with the final Henderson estimates reveals. 

The right panel presents the real time estimates produced by our pro- 
posed boundary filter. The plot reveals that they are more stable and more 
in line with the final ones. The proposed filter is derived according to the 
principle of minimizing the mean square revision error (the mean square 
deviation from the final Henderson estimates), subject to the condition that 
the asymmetric filter reproduces without distortion a linear trend and mak- 
ing the assumption that outside the sample period the underlying signal is 
a quadratic function of time (i.e., is a lower order polynomial outside the 
boundary of the sample space). Full details will be given in the sequel; it 
suffices to say at this point that we introduce bias, in order to reduce the 
variance of the estimates. This strategy proves effective for a large class of 
economic time series considered, as it will be illustrated further in Section 5. 

We can draw here an analogy with the natural boundary conditions that 
are employed in cubic spline smoothing. Also in that framework, the nat- 
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ural boundary conditions imply that the underlying signal behaves differ- 
ently outside the boundary knots. In particular, the cubic spline is linear 
outside the boundary knots; see, for example, the discussion in 
Ruppert, Wand and Carroll (2003), page 72. In our case the assumption 
that the trend-cycle component has a lower order representation outside the 
sample period has similar statistical rationale, being designed to optimize 
the estimation bias-variance trade-off at the end of the sample space. 

3. Local polynomial filters and the Henderson filter. Filters that arise 
from fitting a local polynomial have a well established tradition in time series 
analysis and signal extraction; see Anderson (1971), Chapter 3, Kendall 
(1973), Kendall, Stuart and Ord (1983), and the excellent historical review 
in Cleveland and Loader (1996). In this section we review the derivation of 
linear smoothers for trend extraction. 

Let us assume that time is discrete and that the series can be decomposed 
as yt = fJ4 + £t, where fit is the signal (trend) and et ~ NID(0, a 2 ) is the noise. 
The signal is approximated locally by a polynomial of degree d, so that in 
the neighborhood of time t we can write 



y t+j =m t+j +e t+ j, 



mt+j 



Po + Pij + p2f 



j = 0,±l, 



,±h. 



In matrix notation, the local polynomial approximation can be written as 
follows: 
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Using this design, the value of the trend at time t is simply given by the 
intercept, m t = (3q. Provided that 2h > d, the d + 1 unknown coefficients 
/3k, k = 0, .. .,d, can be estimated by the method of weighted least squares 
(WLS), which consists of minimizing with respect to the (5k s the objective 
function: 



(2) S((3 ,...,Pd) 
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Here, Kj > is a set of weights that define, either explicitly or implicitly, a 
kernel function. 

Reverting to the matrix notation, setting K = diag(ft_/j, . . . , 
K-i, Ko, K\, . . . , Kh)-, the WLS estimate of the coefficients is (3 = (X'KX) _1 X'Ky. 
In order to obtain rht = (3$, we need to select the first element of the vector 
(3. Hence, denoting by ei the d + 1 vector e[ = [1, 0, . . . , 0] , 

h 

ra f = e' 1 ^ = ei(X'KX)- 1 X'Ky = w'y= £ w jVt -j, 

i=-h 

which expresses the estimate of the trend as a linear combination of the 
observations with coefficients 

(3) w = KX(X'KXr 1 ei. 

The linear combination yielding the trend estimate is the local polynomial 
two-sided filter. It satisfies X'w = ei , or equivalently, 

h h 

H w i = !) = °' r = 1, 2, . . . , d. 

j=-h j=-h 

As a consequence, the filter w is said to preserve a deterministic polynomial 
of order d. Moreover, the filter weights are symmetric (w,- =w_j), which 
follows from the symmetry of the kernel weights Kj, and the assumption 
that the available observations are equally spaced. 

The Henderson filter [see Henderson (1916), Kenny and Durbin (1982), 
Loader (1999), Ladiray and Quenneville (2001)] arises as the weighted least 
squares estimator of a local cubic trend at time t using 2h + 1 consecutive 
observations. When d = 3, the weights in (3) take the form 

(Sa ~ S 2 j 2 ) 



3 — 0, ±1, • • • , i/i, 
O0O4 — <->2 

where S r = Yl!j=-h K jJ r ■ This expression makes the dependence on the kernel 
weights, Kj, explicit. Henderson (1916) addressed the problem of defining a 
set of kernel weights that maximize the smoothness of the estimated trend, in 
the sense that the variance of its third differences is as small as possible. He 
showed that up to a factor of proportionality we must have Kj = [(h + l) 2 — 
j 2 ] [(h + 2) 2 — j 2 ] [(h + 3) 2 — j 2 ] . Hence, the coefficients Kj given above define 
the (unnormalized) Henderson kernel. It can be shown that Kj minimize the 
sum of squared third order differences of the weights sequence, Wj. 
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3.1. Asymmetric filters and their automatic adaptation at boundary points. 
The derivation of the two-sided symmetric filter has assumed the availability 
of 2/i + l observations centered at t. Obviously, for a given finite sequence 
yt, t = 1, ... ,n, it is not possible to obtain the estimates of the signal for the 
(first and) last h time points, which is inconvenient, since we are typically 
most interested at the most recent estimates. 

We can envisage three fundamental approaches to the estimation of the 
signal at the extremes of the sample period: 

1. The construction of asymmetric filters that result from fitting a local 
polynomial to the available observations yt, t = n — h + l,n — h + 2, . . . ,n. 

2. Apply the symmetric two-sided filter w to the series extended by h fore- 
casts y n+ /| n , I = 1, . . . , h (and backcasts y\-i\ n ). 

3. Derive the best approximating filter which minimizes the revision mean 
square error subject to polynomial reproducing constraints. 

The second strategy (using forecast extensions) is safer, provided that 
we are capable of producing optimal forecasts according to some paramet- 
ric or nonparametric device, for example, by fitting a time series model of 
the ARIMA class. This idea is embodied in the X-11-ARIMA seasonal ad- 
justment procedure [Dagum (1982)]. An intuitive and easily established fact 
is that if the forecasts y n +i\ n are optimal in the mean square error sense, 
then the variance of the revision is a minimum; see Wallis (1983). In applied 
economic time series analysis most often extrapolations have a local linear 
nature, such as those obtained from ARIMA models with integration order 
equal to 1 or 2 (provided there is no constant term in the latter case). Re- 
cently, Dagum and Luati (2009) derived linear asymmetric filters based on 
data independent extrapolations from fixed ARIMA models and parameter 
values. When the forecast extensions are exogenous, the filter weights are 
adapted to the property of the series, so that the weights Wj are not fixed, 
but depend also on the ARIMA model for yt. 

The trend estimates for the last h data points, rh n _ h+ ^ n , . . . , m n | n , use 
respectively 2/i, 2h — 1, . . . ,h + 1 observations. It is thus inevitable that the 
last h estimates of the trend will be subject to revision as new observations 
become available. In the sequel we shall denote by q the number of future 
observations available at time t (the period which our estimate is referred 
to), q = 0, . . . , h, and by rh t \ t+(l the estimate of the signal at time t using the 
information available up to time t + q, with < q < h\ rh t \ t is usually known 
as the real time estimate since it uses only the past and current information. 

We now deal with the first strategy, which results from the automatic 
adaptation of the local polynomial filter to the available sample; we then 
interpret the results in terms of the other two strategies. The approximate 
model yt+j = m t +j + Et+j is assumed to hold for j = —h, —h + 1, . . . , q, and 
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Let us partition the matrices X, K and the vector y as follows: 



X 



X f 



y P 
y/ 



K 







K 



where y p denotes the set of available observations, whereas y/ is missing and 
X and K are partitioned accordingly. The direct asymmetric filter (DAF) 
arising as the solution to the above weighted least squares problem is written 
in matrix notation as 



(4) 



w 



Kp X p ( Xp K p X p ) 



ei. 



The filter resulting from the automatic adaptation of the local polynomial 
fit can be equivalently derived using the second strategy, assuming that 
the future observations are generated according to a polynomial function of 
time of degree d, so that the optimal forecasts are generated by the same 
polynomial model. Under the local polynomial model the forecasted values 
of y f are 

y/ = x /( x P K P x p)" lx p K py P - 

Applying the two-sided filter w to the observations extended by the forecasts 
yields 



w 



y P 



:ei(X'KX) _1 X'K 



y/ 



m t\t+q 

using X'K = [X p Kp,X' / K / ], 

(X'KX)" 1 = (X.' p K p X p + X' f K f X f )- 1 

= (XpKpXp)" 1 [I + X' / K / X / (X p K p X p )- 1 ]- 1 
and replacing yj gives 

rht\t+q = ei( X p K P X pr lx p K pVp> 

which is also the estimate of the intercept of the polynomial that uses only 
the available information. Hence, the asymmetric filter weights that are au- 
tomatically adapted at the boundaries are given by (4). 

The explicit expressions for the weights of the DAF, w , are derived below 
for d < 3, based on (4) and on matrix inversion formulae. Setting S qr = 
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Y^j=-h? K j f° r Q = 0, ■ ■ ■ ,h, the solutions for d = 0, 1, 2 are, respectively, 



Q ' a J 3 Q Q Q2 ■ 

>~>q0 >J q oD q 2 — O q i 



W •= • ^4,2 ~ jC4,l + j 2 C3,l 

1 S q oC\,2 — SqiCi,! + <Sg2C3,l ' 

j = -q,...,h, where Cm,n. = S qm S qn - S q , m -\S q , n -i- For d = 3, 

^ Z - Zij + Z 2 j 2 - Z 3 j 3 . 

(5) w a j = Kj— — j = -q,...,h, 

where Z = 5^6,4 ~ £^6,3 + 5^5,3; %1 = _ (<SglC6,4 — Sq2(e,3 + 5^5,3); 
Z2 = Sql(&,3 — S q 2Ce,2 + 5^4,2, Z 3 = — (<SglC5,3 ~ S q 2(,b,2 + Sq%Qj>). 

The real time filters arise when q = in the above expressions. The sym- 
metric weights of the smoothing filter, w, arise instead when q = h in the 
above expressions. Replacing Shr — for r odd, we find 



Kj _ Sh^ j S t 



1,2 



3 c 1 3 3 c c c2 

JhO JhO^hi — £>h2 

for d = 0, 1, and d = 2, 3, respectively. 

The direct asymmetric filters can be alternatively derived with the repro- 
ducing kernel Hilbert space (RKHS) approach [see Berlinet and Thomas- Agnan 
(2004)]. In that context, the equivalent kernel of a linear estimator of order 
d can be obtained as Kd(t) = Rd(t,0)fo(t), where Rd(t,0) is the reproduc- 
ing kernel of a Hilbert space of polynomials up to degree d > 1 with inner 
product defined with respect to a density function fo(t). The reproducing 
kernel is so-called because it reproduces any function in the Hilbert space 
in the sense that (g, Rd(t, -))^ = g(t), Vt G T, g £7i, from which many infer- 
ential properties can be derived. Once fo(t) is chosen with finite moments 
Uq, u\, . . . , U2d, one way to obtain the associated reproducing kernel is by 
means of Hankel determinants [Berlinet and Thomas- Agnan (2004), Theo- 
rem 80], in that 

det(H°[l,x f ]) 
Kd{t) - det(HS) /0(t) ' 

where H d * s the Hankel matrix whose elements are the moments of fo(t), 
from uq to Ud in the first row and from Vd to V2d i n the last column, and 
H^[l,xj] is the matrix obtained replacing the first column of H d by the 
vector x t = [l,t,t 2 , . . . ,t d }' . In our discrete setting, choosing the (normalized) 
Henderson kernel Kj in place of the density fo(t), then u r = S qr for r = 
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0, . . . , 2d and the matrix becomes X p K p X p , so that the elements of the 
filter w a are given by 

det(X p K p X p [l, Xj ]) 



where X j = [l,j,j 2 , . . . ,j d ]'- The expression (6) is exactly the same that we 
would obtain by solving for (3q = rht the least squares equation 

(X' p K p X p )P = X' p K p y p , 

using the Cramer rule for the explicit solution of a linear system. In fact, 
setting b = X p K p y p , the first coordinate of the solution vector is 



det(X p K p X p [l,b]) 



det(X p K p X p ) • 
Given that b = J2 9 j=-h x j K jyt+j> then 

det(X p K p X p [l,b]) = det(X;K p X p [l, Xj ]) Kj y t+J 

j=-h 

and, therefore, 

. « det(XiK p X p [l,x s ]) 

™«=.E det{x , KpXp) -m +l , 

from which (6) follows. 

The above expression also holds for symmetric filters, arising when q = h, 
and for any choice of the kernel Kj , providing an alternative way to express 
both the trend estimate and the equivalent kernel of the linear filter resulting 
by weighted linear regression. 

3.2. Properties of the direct asymmetric filters. Let us partition the weights 
of the two-sided symmetric filter in two groups, w = [w p , w^]', where w p con- 
tains the weights attributed to the past and current observations and wj 
those attached to the future unavailable observations. Then, 

w p = KpXpCX'KX)"^! 

= K P X P (X;K P X P + X'jKfXf)- 1 *! 

= [K p X p (X p K p X p ) 1 

-K p X p (X;K p X p )- 1 X , / K / X / (X;K p X p + X , / K / X / )- 1 ]e 1 

= w a - K p X p (X p K p X p )~ 1 X' / w / , 
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as W/ = K f X f (X' p K p X p + X.' f K f X f )- 1 e 1 . 

Thus, we have the fundamental relationship which states how the asym- 
metric filter weights are obtained from the symmetric ones: 

(7) w a = Wp + KpXp(XpK p Xp) _1 X / f w / . 

Premultiplying both sides by X p , we can see that the asymmetric filter 
weights satisfy the following polynomial reproduction constraints: 

X p w a = XpWp + XyW/ = X'w. 

If the design of the time points is centered around the current time, then 
X'w = ei. Thus, the bias in estimating an unknown function of time has 
the same order of magnitude as in the interior of time support. 

We now show that the weights w a are the unique minimizers with respect 
to v of the following constrained problem: 

(v - Wp)'K~ 1 (v - Wp) s.t. XpV = X'w, 



mm 



where w = [w p , w^]'. The first order conditions give v = w p + KpX p Z, where 
I is a vector of Lagrange multipliers. Premultiplying both sides by X^ and 
replacing X p v = X'w gives I = (X p KpXp)~ 1 X'^wj, and replacing into the 
expression for v gives v = w a , as given by (7). 

Hence, the asymmetric weights w a minimize the weighted distance be- 
tween the asymmetric filter coefficients and the symmetric ones, where the 
weights are provided by the reciprocal of the kernel weights. This result is 
useful in order to compare the asymmetric direct filter with the class of 
asymmetric filters derived in Section 4. 

Figure 3 plots the weights of the direct asymmetric Henderson filter for 
q ranging from (real time filter) to h (symmetric Henderson filter), along 
with their gain when the bandwidth takes the value h = 6, producing the 
Henderson 13 terms moving average when all the necessary observations are 
available. The real time filter uses 7 consecutive observations and it is very 
much concentrated on the current observation. As a consequence, the gain 
behaves rather poorly, being close to one also at the high frequencies. 

Hence, our analysis reveals a sort of discontinuity in the behavior of the 
filter, when we move from q = (real time filter) to q = 1 (one future obser- 
vation is available) . The real time filter is unbiased if the series is generated 
by a cubic polynomial; however, the preservation of the bias properties is 
done at the expenses of the variance, which is very high, since most of the 
contribution to the trend estimate comes from the current observation. This 
can be explained by means of the following relation, that gives the leverage 
of the filter, that is, the weight attached to the observation taken at the 
same time we are interested in the trend estimate, 

det(M lj i) 



w a ,o = Koei(X'KpXp) ei = k 



det(X„KpX, 
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Fig. 3. Gain, phase and weights for the symmetric and asymmetric Henderson filters 
w a ; q is the number of future observations available for estimating the signal. 



where Mi i is the submatrix obtained by deleting the first row and column 
of XpKpXp. 

(i) For fixed values of d, the leverage decreases as long as the span of the 
filter increases. It is maximum for the real time filter (q = 0) and minimum 
for the symmetric filter {q = h). 

(ii) On the other hand, for fixed values of h or q the leverage expo- 
nentially increases if the degree of the fitting polynomial increases. It is 
minimum for d = and maximum for d = h. 
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In particular, w ai o = 1 for d = h. The latter equality can be proved by 
noticing the general fact that the h + 1th row of X p (last row when real 
time filters are considered), whose elements correspond to j r , r = 0, . . . ,d, is 
the vector e[ = [1, 0, 0, . . . , 0] . Given that K p is diagonal, it follows from the 
row-column matrix product that 

Mi,! = X fe+1)1 K ft+li/l+1 X fe+ i ) i, 

where Xj j and Kjj are submatrices obtained by deleting the ith row and 
jth column of X p and K p . If d = h, then X p and IL^+n are square matrices 
that have different dimensions but the same determinant, as it is immediate 
to see by calculating det(X p ) from the last row of X p with the Laplace 
formula. Hence, it follows from the Binet-Cauchy theorem that 

det(Mi,i) _ det(X / fe+11 )det(K fc+ i ife+ i)det(X fe+ i ; i) _ l 
det(X p KpXp) det(X' )det(K p )det(Xp) kq 

and, therefore, w aj o = 1. Since the filter reproduces polynomials up to the 
order d, w a j = for j = —h, . . . , — 1. This result holds also for symmetric 
and nearest neighbor filters, where the maximum value d can assume is 2h. 
Proof of (i) and (ii) is given in the Appendix based on a generalized version 
of the Binet-Cauchy theorem. Note that these relations can be verified by 
(5), up to d = 3, and by (6). 

Table 1 illustrates how w Qi o varies with the length of the asymmetric 
filters and the degree of the fitting polynomial. The values are calculated 
for h = 6 and d ranging from d = (constant trend) to d = 6 (six degree 
polynomial) and q ranging from q = (real time filter) to q = 6, which gives 
the symmetric 13 term Henderson filter. It is evident, and not surprising, 
that the impact, on the leverage, of the degree of the fitting polynomial is 
much greater than that of the span of the filter. Even for small values of the 
order of the approximating polynomial, the leverage of the real time filter 
results in greater than 0.5. 

We have alternatively evaluated the values of Table 1 using different ker- 
nels, such as the Uniform and the Epanechnikov [Epanechnikov (1969)], but 
the resulting real time filters are almost equivalent to those calculated with 
the Henderson kernel, and therefore not illustrated here. 

4. On a general class of asymmetric filters. We now consider a class of 
asymmetric filters approximating a given symmetric two-sided smoothing 
filter. The class depends on unknown features of the series, such as slope 
and curvature, which can be estimated from the data, and encompasses 
the so-called Musgrave's surrogate filters (1964). The latter, which will be 
discussed in Section 4.1, approximate the two-sided Henderson filter at the 
end of the sample and are a component of the well-known X-ll cascade 
seasonal adjustment filter. 
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Table 1 

Values of w a ,o for h = 6, different orders d of the local polynomial and different values of 
q ranging from q = (real time) to q = 6 (symmetric filter) 







d = 


d=l 


d = 2 


d = 3 


d = 4 


d = 5 


d = 6 


q 


= 


0.2457 


0.5856 


0.8356 


0.9552 


0.9925 


0.9994 


1.0000 


q 


= i 


0.1991 


0.3038 


0.3060 


0.4560 


0.7285 


0.9238 


0.9908 


q 


= 2 


0.1712 


0.2008 


0.2653 


0.4275 


0.4493 


0.5189 


0.7662 


q 


= 3 


0.1547 


0.1615 


0.2652 


0.3385 


0.3603 


0.5144 


0.5397 


q 


= 4 


0.1456 


0.1466 


0.2578 


0.2776 


0.3577 


0.4309 


0.4594 


q 


= 5 


0.1413 


0.1414 


0.2472 


0.2495 


0.3516 


0.3644 


0.4593 


q 


= 6 


0.1400 


0.1400 


0.2400 


0.2400 


0.3379 


0.3379 


0.4418 



The minimum mean square revision error strategy which is at the 
basis of the criterion (8) was originally proposed by Musgrave (1964). 
Gray and Thomson (2002) generalized this idea to the case of a series gen- 
erated by a local dynamic model. In this section we propose a different 
derivation of Gray and Thomson's result that is more general and clarifies 
some issues of the design of asymmetric filters, among which the connections 
with the DAF. We also provide an alternative expression for the asymmetric 
weights that is directly connected to Musgrave's result. 

Assume that the observations are generated as y = U7 + ZS + e, e ~ 
N(0,D), where U, Z are a suitable design matrix. We aim at determining 
the asymmetric filter v minimizing the mean square revision error subject 
to constraints. The constraints are specified as follows: U p v = U'w, where 
U = [Up, Uj]'. Assuming that [U, Z] is full column rank (usually, as it will be 
illustrated later, [U, Z] is a selection of the columns of X or it is coincident 
with X), and partitioning D = diag(D p , D/), the set of asymmetric weights 
minimizes with respect to v the following objective function: 

</?(v) = (v — Wp)'D p (v — W p ) +WjDjWj 

(8) 

+ [<5'(Zp V - Z'w)] 2 + 2Z'(U p v - U'w). 

The revision error arising in estimating the signal mt is rh t \ t — wit = v Vp ~~ 
w'y. Replacing y p = U p 7 + Z p S + e p , and y = U7 + ZS + e, and using 
U' v = U'w = 0, we obtain m t \ t — nit = (v'Z p — w'Z)<5 + v'e p — w'e, where 
e= [£ p ,e'f]'. Hence, the first three summands of (8) represent the mean 
square revision error, which is broken down into the revision error variance 
(the first two terms) and the squared bias term [S'(Z' p v — Z'w)] 2 . The vector 
I is a vector of Lagrange multipliers. 

Setting 



Q = D p + Z p SS'Z' p , 
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the first order conditions for the minimization problem can be written as 
follows: 

v = w p + Q^ZpdS'Z'fWf - Q l XJ p l. 
Premultiplying both sides for XJ' p and recalling U p (v — w p ) = UjWj, 

U} W/ = U'^ZpSS'Z'fWf - U^Q-^pi, 

we can express the Lagrange multipliers as a linear combination of the 
weights wj: 

I = -[UpQ^Up]- 1 ^ - V'pQ^ZpdS'Z'^Wf. 
Replacing into the expression for v yields 

v = w p + Q^ZpSS'ZfWf + Q-^UpQ-^p]- 1 ^ - U^Q^ZpM'Z^w/, 
and rearranging, 

(9) v = Wp + LU' f w / + MZp<5<5'Z' / w / , 

with 

M = Q" 1 - Q^UpfU^Q-^pJ-^Q" 1 , L = Q^UppU^Q^Up]- 1 . 

The matrices M and L have the following properties: UpM = 0, U p L = I. 
Alternatively, the solution can be written as follows: 

v = w„ + L*U' f Wf 
(10) 1 

+ KZ p dd'[I + Z' p KZ p SS'r 1 [Z' f - ZpDp" 1 Up(U;Dp- 1 Up)- 1 U / ]w / , 

whereL* = Dp 1 Up(UpDp 1 Up)- 1 ,R = Dp 1 -Dp 1 Up(UpDp 1 Up)- 1 U;D- 1 , 
so that UpL* = I, UpR = 0. The proof of the equivalence is direct. 

It should be noticed that the DAF arises in the case D = K _1 and U = X, 
so that the bias term is zero. When D = a 2 I and 5'(Z' p v — Z'w) = (no bias 
term) , an alternative equivalent derivation of the asymmetric filter approxi- 
mating the two-sided local polynomial filter is based on the constrained min- 
imization of the integrated squared modulus of the difference between the 
transfer function of the symmetric filter, denoted w(e~ luJ ) = J2j=-h w j e ~ luJ ^ 
where i is the imaginary unit, and that of the asymmetric filter, denoted 
v(e-' w )=£j=- g v i e- ,w J. 

In particular, the asymmetric filter weights solve the following problem: 

mini f |w(e— )-y(e~ w )\ 2 dw + £ \J j h vj - £ /w, 
I** 77 fc=0 \j=-g j=-h 

where r can be equal to 1,2, ...,d and is a Lagrange multiplier. This 
approach has been applied for the construction of a filter approximating 
an ideal low-pass filter; see Percival and Walden (1993), Section 5.8 and 
Baxter and King (1999) for an application to the measurement of the busi- 
ness cycle. 
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4.1. Musgrave asymmetric filters. Musgrave's asymmetric filters [Musgrave 
(1964), Doherty (2001), Quenneville, Ladiray and Lefrancois (2003)] are ob- 
tained in the particular case when the original two-sided symmetric filter is 
the Henderson filter and U = i, Z = [—h, —h + 1, . . . , h]', 8 = 5±, D = cr 2 I, 
that is, when U and Z are respectively the first and the second column of 
the design matrix X. 

It is nevertheless convenient for comparison purposes to reset the time 
origin and derive the filter under the equivalent design 

U = i, Z = [1,2,..., 2/i + l]', S = 6 U D = cj 2 I. 

It is assumed that a linear process yt = 70 + 5\t + £t, t = 1, . . . , n, E(et) = 0, 
Var(et) = a 2 , generates the observations, and that the asymmetric filter has 
to preserve a constant signal, that is, J2 v i = 1- 

Then, if M denotes the number of elements of Z p , h < M < 2h + 1, and 
we let H = 2h+ 1, 

1 H 
L * U / W / = T7 E w i' 

lyl j=M+l 

M + l 



j=M+l 



2 



HZpSS' — — 



5f (. M + l 
a 



As a result, the usual expression for vm, the ith element of the vector v, as 
presented in Doherty (2001), Findley et al. (1998) and Ladiray and Quenneville 
(2001), in terms of the elements of the vector w = {wui, j = 1, . . . , H}, is ob- 
tained: 

H 



1 \- 

WM+ M E w b1 

j=M+l 



(11) 



+ 6fr M + l\ J2f=M+iU-(M+l)/2)w m 



a 2 \ 2 J l + (5 2 /a 2 )M(M + l)(M-l)/12' 



for i = l,...,M. 

The ratio ^ is related to R = I/O, as % = 4/(-7ri? 2 ), where I is the ex- 
pected absolute difference of the irregular component and C is the expected 
absolute difference of the trend component. Assuming et ~ NID(0, a 2 ), \et — 
£t—i\ is half normal with expected value 4c/7r and C = 5±, if the underlying 
signal is a linear trend with slope 5\ . 
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The limit of (11) as (o- 2 /Sj) ^0 (or, equivalently, S 2 /a 2 



OO IS 



(12) 




M+ 1 \ Ej=M + iti - (M + l)/2)w b1 
2 J M(M + 1)(M-1)/12 



for i = 1,...,M. This expression corresponds to what we would obtain if 
the unavailable future observations were replaced by linear extrapolations 
formed from the available data. See also [Doherty (2001), Section 6]. 

4.2. The properties of the approximate filters. The approximate filters 
that minimize (8) raise a controversial point. The symmetric filter was de- 
rived from the assumption that the series behaves locally according to a 
polynomial of degree d. We seek to approximate this filter by changing our 
assumption about how yt is generated, postulating that it has been possibly 
generated by a lower order polynomial or that the asymmetric filter is only 
capable of reproducing a polynomial of lower degree. In one way or another 
we are denying the conditions under which the original smoothing filter was 
derived. However, it is clear from our previous discussion that the original 
motivation for introducing a new class of approximating filters was the fact 
that the direct real time filter delivers very volatile estimates; hence, we had 
to move away from the direct strategy of fitting the maintained polynomial 
to the available observations. Second, it is not implausible to assume that 
the behavior of the signal at the extremes is different from that in the inte- 
rior of the sampling design. An analogy can be drawn with cubic smoothing 
splines: the so-called natural boundary condition is such that the spline is a 
local cubic function of time inside the boundary and is linear outside. This 
is beneficial to the reliability of the real time estimates and of the forecasts. 
The strategy that is adopted in the approximation is very similar since it 
effectively amounts to reducing the order of the fitting polynomial. 

The merits of the class of filters (10), relative to the DAF, lie in the bias- 
variance trade-off. In particular, the bias can be sacrificed for improving the 
variance properties of the corresponding asymmetric filter. If U = X, that is, 
it is asked of the filter to be capable of reproducing a dth order polynomial, 
which is also the process generating the observations, the approximate filter 
minimizing </?(v) will not differ from w a (in the light of the result in Section 
3.2 they will coincide if D = K _1 ); as a consequence, the real time filter 
will be strongly localized, and it will suffer from the same limitations as the 
DAF discussed in Section 3.1, namely, its estimates will be characterized by 
high variance. 

When U is a subset of the columns of X, spanning a polynomial of degree 
d* < d, then we require that the filter is capable of reproducing a polynomial 
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of degree d*; if the observations are generated by a polynomial of degree 
greater than d* , a bias will arise, which depends on the value of S. However, 
the weights of the approximating filter will be more evenly distributed and 
the variance will be reduced. Thus, the overall mean square revision error 
may eventually be reduced if the actual signal is weakly evolutive. 

Figure 4 plots the gain and the phase function of the real time filter 
when h = 6, w is the two-sided Henderson filter, and U = i (the asymmetric 
weights have to satisfy the constraint v « = 1)> Z = [— h, —h + 1, . . . , h — 
l,h]', S = #i,D = <7 2 I. As the filter depends on the slope of the underlying 
signal through the ratio 5 2 /a 2 , we plot two limiting cases arising when the 
slope is negligible and when it is the dominating feature. The intermediate 
case is the well-known Musgrave surrogate real time filter for the Henderson 
with 13 terms, which rises when Sf/a 2 = 4/(3. 5 2 7r), with R = 3.5 being the 
value selected for the Henderson filter with 13 terms. The filter weights are 
displayed in the bottom right panel of the same figure. 

When the slope is negligible, b\ja 2 = (or, equivalently, R — ► oo, which 
arises either when the signal is constant and is devoid of the linear term 
or the signal is buried in a heap of noise), the optimal approximation to 
the Henderson two-sided filter features weights that are less dispersed and 
the gain decreases from 1 almost monotonically, as it ought to be expected. 
The individual weights of the real time filter, vo, • • • ,v^, are plotted against 
the value of S 2 /a 2 in the bottom left panel. As the linear signal is stronger, 
the dispersion of the weights increases and the gain becomes higher at each 
individual frequency, getting greater than one at the low frequencies. 

The class of filters (10) accommodates the case when the two-sided sym- 
metric filter is the Henderson filter and an approximation is sought such that 
for a locally quadratic underlying signal, yt+j = 7o + 7i j + <^2 j 2 + £t+j , £t+j ~ 
IID(0, a 2 ), and requiring that the approximating filter preserves a linear sig- 
nal, which is achieved by imposing the constraints J2 v j = ^■>J2j=-q v jJ = 
J2j=-h w jJ- In (8) we set 



U' 

and 



1 1 ••• 1 ••• 1 1 

-h -h + 1 ■■■ ••• h-1 h 



Z = [{-hy,{-h + l) 2 ,...,l,0,l,...,h 2 Y, S = 5 2 , D = cj 2 I. 

Hence, U consists of the first two columns of X and Z is the third column, 
and the filter weights depend on the curvature of the underlying signal via 
5 2 /a 2 . The filter v will be referred to as the quadratic trend-linear fit (QL) 
approximation to the Henderson filter. 

The first row of Figure 5 considers the real time QL filter and plots the 
gain, the phase and the individual filter weights for different values of 5% /a 2 , 
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Fig. 4. Gain, phase and weights for the real time filter minimizing the revision mean 
square error subject to J^v; = 1, when the observations are generated by a linear trend. 



which expresses the relative importance of the curvature of the signal. The 
rationale for this particular type of asymmetric filter is early detection of 
turning points, which are a quadratic feature of the signal; setting 82 /<J 2 to 
a high value, the optimal filter would weight more the current observation 
and detect a turning point more rapidly. Essentially, with respect to the 
Musgrave type of filters, the bias is reduced at the expense of the variance. 
It should also be noticed that the optimal filter for b\ja 2 = is coincident 
with the optimal filter derived under a linear trend signal and using the 
constraint J2i v i = with 5 2 /a 2 — > 00; compare Figure 4. 

Finally, the bottom panels of Figure 5 display the gain, phase and filter 
weights of the real time filter approximating the Henderson filter when the 
series is generated by a cubic polynomial, y t +j = 70 + 71J + 72J 2 + <^3j 3 + 
£t+j , and the weights have to satisfy the quadratic reproduction constraints 

Evj = 1, J2j=- g Vjj = Ej=-h w j3 and Evjj 2 = Ej=-h w jf- These filters 
will be referred to the cubic trend — quadratic fit (CQ) asymmetric filters. In 
this case U is a matrix formed from the first three columns of the X matrix, 



U' 



1 

-h 
h 2 



1 

-h+l 
-h + lf 



1 






1 

h + l 
(h + l) 2 



1 

h 
h 2 




Fig. 5. Gain, phase and weights for the real time filter minimizing the revision mean 
sguare error: QL and CQ filters. 



and 

Z=[(-/ l ) 3 ,(-/ l + l) 3 ,...,l,0,l,...,/i 3 ]', S = 5 3 , D = a 2 I. 

In this case the optimal filter depends on the parameter 5 2 /a 2 , which 
is a measure of relative inflexion. Again, the optimal filter for 5 2 /a 2 = is 
the same as the QL filter arising for 5 2 /a 2 — > oo; compare the top panels 
of Figure 5. As S^/a 2 — > oo, the filter is the same as the direct asymmetric 
filter of Section 3.1. 



5. Illustrations. In this section we provide illustrations concerning the 
use of the general expression (9) for the design of real time filters suitable 
for a particular time series. The reference two-sided filter is the Henderson 
filter and we estimate the bandwidth h by cross-validation. 

Let rh t \t denote the two-sided estimate of the signal at time t which does 
not use yt- The latter can be expressed in terms of the Henderson estimate 
of the trend using the central filter: 

m t \ t = ei(X'KX - K eie' 1 )~ 1 (X / Ky - n y t ei) 

( X ' KX ) _1 + \ fWYVi. (X / KX)- 1 e 1 e / 1 (X / KX)- 1 

1 — Ko e i (,-X/KXJ 1 ei 
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x (X'Ky - K 2/tei) 



— e / 1 (X'KX)- 1 (X'Ky - Ko y t ei) 



1 „ w 



1 — Wq 1 — WQ 



The leave-one-out, or deletion, residual can be expressed in terms of the 
trend estimate using all the observations: 



The cross-validation score is the sum of the squared deletion residuals: 



Conditional on the value of h we consider three classes of filters: 

Asymmetric LC. The asymmetric LC (Linear-Constant) real time fil- 
ter arises as the best approximation to the two-sided Henderson filter, as- 
suming that yt is linear and imposing the constraint that the weights sum 
to 1. Hence, U = i, the unit vector, and the asymmetric filter depends on 
Si; see expression (11). 

Asymmetric QL. The asymmetric QL (Quadratic-Linear) real time 
filter arises as the best approximation to the two-sided Henderson filter, 
assuming that yt is quadratic and imposing the constraint that the estimates 
are capable of reproducing a first degree polynomial (see Section 4.2). 

Asymmetric CQ. The asymmetric CQ (Cubic-Quadratic) real time 
filter arises as the best approximation to the two-sided Henderson filter, 
assuming that yt is a cubic function of time and imposing the constraint 
that the estimates are capable of reproducing a second degree polynomial 
(see Section 4.2). 

The three asymmetric filters depend on a single parameter, 5f/a 2 , i = 
1,2,3. For each we compute the value that minimizes the mean square re- 
vision error (MSRE), that is, the value for which Y^Zh+ii^t — ™t\t) 2 /( n ~ 
2 h — 1) is a minimum. 
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Fig. 6. Selection of a real time filter: Index of Industrial Production, Italy, branch DL. 
Source: Istat. 

5.1. Italian index of industrial production. Our first illustration deals 
with the Italian index of industrial production for the branch DL (Manufac- 
ture of electrical and optical equipment, Nace Rev. 1 classification). The se- 
ries is produced by ISTAT, the Italian National Statistical Office, and made 
available on the website www.istat.it. The index shows the evolution of gross 
production in volume terms and represents a key short term indicator, due 
also to its timeliness, being made available with a delay of 43 days after the 
end of the reference month. The data are collected monthly through a sur- 
vey of establishments with at least 20 employees that make up at least 70% 
of total production. The volume of production in month t is compared to 
the average production of the base year (2000 for the current release). This 
dataset is available as supplementary material [Proietti and Luati (2008)], 
along with the other time series used for our illustrations. 

The value of the bandwidth selected by cross-validation is h = 15; the two- 
sided estimates of the trend are displayed in the right top panel of Figure 
6. We next look for the best approximation to the Henderson filter within 
the three particular classes. For this purpose we estimate the values of the 
parameters 5f /a 2 , i = 1,2,3, using a grid search. The results are presented 
in the bottom left panel of Figure 6. The minimizers of the MSRE are 
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Fig. 7. Index of Industrial Production, Manufacture of electrical and optical equipment, 
Italy. Comparison of the real time estimates arising from three approximating filters and 
final estimates of the trend component. 



(5 2 /a 2 ) =0.103, (5%/a 2 ) =0.016 and (<5 2 /cr 2 ) = 0.003, respectively, for the 
LC, QL and CQ classes. As illustrated by Figure 6, the best approximation 

to the original Henderson filter is provided by the QL filter with (S 2 / cr 2 ) = 
0.016. We need the real time filter to be capable of reproducing a linear 
signal and to react somewhat, although not in full, to the curvature of the 
underlying trend. 

Figure 7 compares the real time estimates of the trend for the period 
January 2002-December 2006, rh t u, arising from the best LC, QL and CQ 
approximations. It it is clear that the LC filter is biased when the slope 
is substantial: the bias is positive in a recessionary period and negative 
in expansion. This is so since the filter can only preserve a constant, but 
will distort a local linear trend. The optimal QL approximation provides 
the best approximation since the real time estimates are closer to the final 
Henderson estimates. The CQ approximation tracks the data quite well, but 
the corresponding estimates are affected by higher variance, compared with 
the QL estimates. Similar considerations apply to the DAF estimates, not 
reported for brevity. For the class of economic time series that are usually 
considered, such as industrial production, the evidence definitively points 
out that the direct asymmetric filter produces the most inefficient estimates, 
due to the very high variance inflation. 
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Fig. 8. Selection of a real time filter: Assessment of order-book levels, Euro Area. Source: 
European Commission. 



5.2. Assessment of order-book levels. Our second illustration deals with 
the monthly assessment of order-book levels for the 13 countries constitut- 
ing the Euro area. The series is produced by the European Commission, 
Directorate General for Economic and Financial Affairs, which conducts a 
monthly survey of the industrial sector of the economies in the European 
Union. The survey is largely qualitative and is administered to a purposive 
sample of about 23,000 representative firms. The main questions refer to 
an assessment of recent trends in production, of the current levels of order 
books and stocks, as well as expectations about production, selling prices 
and employment. The survey question from which our series originates is 
whether over the past three months the firm's orders have increased, re- 
mained unchanged or decreased. Answers obtained from the surveys are 
aggregated in the form of balances, which are constructed as the difference 
between the percentages of respondents giving positive and negative replies. 
See European Commission (2007) for more details. 

The series is made available on the website http:/ /ec.europa.eu/economy_ 
finance/db_indicators and is plotted in the second panel of Figure 8. It pro- 
vides an interesting case study, since its dynamic behavior is highly cyclical. 
The sample period considered is January 1985-September 2006. 
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As it can be seen from the first panel of Figure 8, the value of the band- 
width parameter suggested by cross-validation is h= 11. The two-sided esti- 
mates of the trend resulting from the Henderson filter corresponding to the 
selected h value are plotted in the right top panel. The mean square revi- 
sion error for the three filters is plotted in the bottom left panel against the 

value of Sf/a 2 , i = 1,2, 3. The minimizers of the MSRE are (If/a 2 ) = 0.173, 

= 0.041 and (df/a 2 ) = 0.007, respectively. Overall, the MSRE is min- 
imized by the QL filter, which again is our preferred real time filter. The 
bottom right panel displays the revisions rh t u — rht for the optimal filters 
belonging to each subclass, showing that the latter are particularly large for 
the LC filter. 

Figure 9 compares the real time estimates of the trend component for 
the period January 2002-September 2006, arising from the best LC, QL 
and CQ approximations. The series is characterized in this period by the 
presence of several turning points and by the rapid alternation of different 
business cycle phases. The plot illustrates that, due to the asymmetry of the 
real time filters, all the real time trend estimates suffer from a displacement 
of the turning points along the time axis, also known as a phase shift; the 
best performance is, however, provided by the QL approximation. The LC 
filter can depart quite substantially both from the final trend estimates and 
from the actual series values, during the phases of steep recovery after a 
lower turning point; on the other hand, the asymmetric CQ estimates are 
too responsive to the observations and suffer from excess volatility. 

5.3. Housing starts. We conclude with a more detailed treatment of the 
series concerning the number of new housing units started in the US, consid- 
ered in Section 2 and depicted in Figure 1. The value of the bandwidth es- 
timated by cross-validation is h = 10 (the cross-validation score is presented 
in the top left panel of Figure 10), and, thus, the Henderson estimates of the 
trend, displayed in the top right panel, are based on 21 consecutive obser- 
vations. For the estimation (at the beginning and) at the end of the sample 
period, the best asymmetric approximation is provided by the QL filter with 

(g/^2) = 0.029. 

Figure 11 compares the three real time estimates of the trend component. 
Those yielded by the QL asymmetric filter provide the best compromise 
between flexibility and smoothness: they are indeed more flexible than the 
LC estimates, which is particularly advantageous during the last steep re- 
cession initiated in 2006, but far less volatile than the CQ estimates, which, 
on the contrary, are too sensitive to the influence exerted by the current 
observations. 

In conclusion, the evidence presented in this section illustrates that the 
proposal of designing asymmetric filters in a more general and flexible way 
helps estimating the underlying signal with greater accuracy. 
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Fig. 9. Assessment of order-book levels, Euro Area. Comparison of the real time esti- 
mates arising from three approximating filters and final estimates of the trend component. 




0,0 0.1 0.2, , OJ 0.4 1960 1970 1980 1990 200(1 



Fig. 10. Selection of a real time filter: US Housing Starts. Source: European Commis- 
sion. 



28 



T. PROIETTI AND A. LUATI 




Fig. 11. 175 Housing Starts. Comparison of the real time estimates arising from three 
approximating filters and final estimates of the trend component. 



6. Conclusions. The paper has considered the problem of estimating the 
trend of a time series in real time by means of local polynomial filters; we 
showed that automatic adaptation at the boundary fails due to the high 
volatility of the estimates. We thus evaluated the strategy of approximating 
a given symmetric local polynomial filter by minimizing the mean square 
revision error subject to different order polynomial reproducing constraints 
and by making certain assumptions concerning the nature of the underlying 
signals. Restricting our attention to three families of real time filters that 
depend on certain key features of the unknown signal, such as its slope and 
curvature, we proposed to estimate these key features from the available 
data, rather than taking a fixed filter. 

Our empirical illustrations concerned the minimum mean square revi- 
sion error approximation of the Henderson filter, a very popular local cubic 
smoother. They enable us to conclude that we can improve upon the well- 
known Musgrave asymmetric filters, which for the series considered suffer 
from very large revisions, especially in steep recessions and recoveries and 
around turning points. This evidence arises as a consequence of the fact that 
the filter is not designed to deal with signals characterized by strong slope 
and curvature. 

We also considered the strategy of building either direct or minimum 
revision mean square approximations using the same fixed number of ob- 
servations (nearest neighbor bandwidth), rather than a fixed bandwidth. 
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The nearest neighbor bandwidth, proposed by Cleveland (1979), has certain 
advantages over the fixed bandwidth, and, in particular, when the observa- 
tions become sparse. In fact, it can be shown that the minimum mean square 
asymmetric approximation has better theoretical properties than the fixed 
bandwidth counterpart, but its effectiveness in ameliorating the approxima- 
tion to the Henderson filter was not proven by our empirical applications. 



APPENDIX 

When d < h (or d < 2h in the nearest neighbor case, where q = h + 1 
is fixed) and h + q + 1 is the asymmetric filter length varying with q = 
0, . . . , h— 1, the generalized Binet-Cauchy formula can be used to determine 
how the leverage varies with d or h: 

det(Mi, x ) _ det ( x 'h+i,i.^) det ( K h+i,h+u j )det(X h+hl7rr ) 



det(X^K p X p ) Ei/' |1) det(X: 7r .)det(K 7r .)det(X,..) 

(13) 

where X^. denotes a square submbatrix of X^ obtained taking all its rows 
and d+1 columns chosen on the set irj of the h + q + 1 columns of X^ and 

/h + q+l\ 

the summation Y^j d+1 ls extended to the i hJ ^[ 1 ) subsets of 1, . . . , h+q + 1 
with d+1 elements; ~K nj is the square submatrix of K p whose d+1 columns 
(and rows) correspond to those chosen for X'^ . The matrices X^ +1 1 ^ and 

Kft+i^+i are of dimension d x d. 

It is immediate to verify that, for q = d = 0, the ratio is equal to (J2j=o 
S'oq 1 . On the other extreme, for q = 0,d = h, we find the classical Binet- 
Cauchy formula for square matrices giving ratio equal to k^ 1 . In the interior 
< d < h the ratio (13) becomes 



(h+q\ 
d 



E det ( x ^ +1) ) 2det ( K ^ +1) , 



( h+ d q + V)-( h i q ) 
+ E det ( x ^ /(h+1) ) 2det ( K ^/ ( , +1) : 



where TTj^+i) indicates that only the submatrices of X^ or K p with the 
column h+1 are considered, while Tfj/Qi+i) indicates that the column h+ 1 is 
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not included. Since det(X' 7r 



±det(Xj l+1 x ), for even or odd values 



of d + 2, respectively, and det(K 7I . j . (fe+1) ) = k det(K h+ i jh+l7rj . ), then 

det(X^. {h+i) ) 2 det(K^. (h+1) ) = Ko El/ 9) det(X^ +1|1 . 5ri ) 2 det(K ft+1|h+lwi ), so 
that 



E det(X' ,, +1 J 2 det(K. 



T j(h+1)' 



( h+ d q ) 

E det ( x ^c, +1 )) 2det ( K ^c, +1 



k d+1 J 



1 -1 



+ E MX' J 2 det(K. 



The above expression enables to write (13) and, consequently, Wq as a 
function of determinants which are positive and refer to matrices having 
the same dimensions, d+1 xd+l. Hence, it follows that wg increases (de- 

\2. 



decreases 



/h+q+l\_/h+q\ 

creases) if the value ; 1 d ' det(X^, /(h+i) ) 2 det(K 7r . / 

(increases). The latter is made of positive terms, so that it is sufficient 
to evaluate how this number of terms varies by varying d,q,h to determine 
how wg varies accordingly. Given that d < h and q > 0, then h + q > d + 1 
and, therefore: 

(i) for fixed d, an increase in q or h implies an increase in that is, 
a decrease in wg; 

(ii) for fixed h, an increase in d implies a decrease in {^f), that is, an 
increase in wS. 
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SUPPLEMENTARY MATERIAL 

Datasets (DOI: 10.1214/08-AOAS195SUPP; .zip). The supplementary ma- 
terial contains the time series used to illustrate the methods. The series are 
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the following: assessment of order-book levels, housing starts: total: new 
privately owned housing units started, and index of industrial production, 
branch DL, manufacture of electrical and optical equipment 
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